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(57) Abstract: The present invention relates to a polynudeotide eric(.Kling ihc corncodcsmosin protein having one. or more uacleolidc 
inse]1ioi7S, deletions or substitutions at one or novel posilions Tbc invention also relates to the comeodcsniosin p.rotein having one 
or more amino acid insertions, deletions and substitutions These nucleotide and amino acid polymiorphisms arc useful in diagnosi ng 
or delciTOining susceptibility to comcodesmosin mediated disease, for example inflammaloiy diseases including psoriasis, and in 
treating such disease Tlost cells and ti-ansgenic non-human animals coinpiising polynucleotides or proteins of the invention are 
1^ providetl Methods of screening for agents for use in lieating comeodcsmo sin mediated disease arc also provided 
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Test and Model for Iiiflarmnatoiy Disease 

The present mvention relates to nucleotide substitutions, deletions or insertions in the 
corneodesmosin gene, and the exploitation of these polymorphisms in the detection 
5 and/or treatment of corneodesmosin mediated disease, for example inflammatory 
diseases including psoriasis „ The present invention also relates to polynucleotides 
encoding the corneodesmosin protein, and having one or more nucleotide 
polymorphisms, and to a protein encoded by said polynucleotides. Also provided 
are transgenic non-human animals comprising the polynucleotides of the present 
10 invention; and methods and kits for treating, diagnosing or determining susceptibility 
to corneodesmosin mediated disease, m particular by way of gene therapy. 

In recent years, it has been recognised that there is considerable genetic diversity in 
human populations, with common polymoipliisms occmiing on average at least every 
15 kilobase in the genome. Polymorplnsms which affect gene expression or activity of 
the encoded gene product may account for susceptibility to, or expression of, disease 
conditions, either directly or through interaction with other genetic and environmental 
factors. 

20 Understanding the molecular basis for disease, by sequencing the human genome and 
chai'acterising polymorphisms, will enable tlie identification of those individuals at 
greatest risk of disease. This will allow the better matching of treatment and disease, 
and enable the production of new and improved, targets for drugs. Screening and 
treatment of disease may also be better targeted to those in need, thus increasing the 

25 cost-effectiveness of health-care provision. 



One area in need of such approaches is the diagnosis and treatment of inflammatory 
diseases- Inflammation, which can be broadly defined as tire deshmctive sequelae to 
activation of elements of the body*s inminne system, is a feature of many diseases 
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mcluding infection, autoimmune disordeJ:s and benign, and malignant hyperplasia. Hie 
identification of gGnetic factors which inflnence susceptibility to such disorders will 
provide important new insights into inflammatoiy disease, and may 3a eld important 
new diagnostic and/or prognostic tests and treatments.. 

5 

Psoriasis is a chronic inflammatory cutaneous disorder which affects approximately 
2% of the population in the UK and US.. Psoriasis manifests itself as red scaly skin 
patches, principally on the scalp, elbows and laiees, and is caused by epidennal 
hyperproliferation, and abnormal differentiation and infiltration of inflannnatoiy cells. 
10 Psoriasis may also be associated with other inflammatoiy diseases such as arthritis, 
Crolm's disease, and HIV infection. Population, family, and twin studies all suggest 
an important genetic component in die pathogenesis of psoriasis, coupled wifli 
environmental triggers such as streptococcal infection and stress. 

Psoriasis is one of a number of autoinnnune diseases that display significant honian 
leukocyte antigen (HLA) associations. The analysis of population-specific HLA 
haplotypes hi^ provided evidence that susceptibihty to psoriasis is linked to the class I 
and n major Instocompatibility complexes (MHC) on human chromosome 6 (Jcnisch 
et al (1998) Am. 1 Hum. Genet 63:191-199). These studies show that psoriasis 
consists of two distinct disease subtypes (Type I and Type H), which differ in age of 
onset and in the frequency of HLA types. Type I psoriasis has an age of onset of 
prior to 40 yeai"s and HLA types Cw6, B57, and DR7 are strongly increased. Patients 
with Type I psoriasis are much more likely to have a positive family history for the 
disease. In contrast, only about 10% of Cw6-'positive individuals develop Type II 
psoriasis disease, vidth HLA-Cw2 being over-represented in this group. 

Linkage analysis and association studies suggest the presence of a major genetic 
determinant of psoriasis within the MHC, the strongest candidate gene marker being 
HLA-C. The most significant association has been shown between HLA-Cw<; and 
30 disease Type lA, which has the earliest onset of disease at 0 to 20 yeai-s. However, 
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specific involvemeDt of the HXA-Cw6 genotype m disease pathogenesis has yet to be 
establislied 

Recently, attention has focussed on non-HLA genes close to HLA -C, in parti cnlar the 
5 comeodesmosin gene (also known as the S gene), which is located approximately 1 60 
kb telomeric of the HLA-C locus. The comeodesmosin gene consists of 2 exons 
spanning approxinnatdy 53 kb of genomic DNA sequence. Two comeodesmosin 
niRNAs of 2.2kb and 2,6kb, resulting from alternative splicing, have been described 
(Guenin ei al (1998) ./ Biol Chem... 273:22640-22647), Association studies (Ahnini 
10 et al (1999^ Hum, Mol Genet. 8:1135-1140) suggest a strong, significant association 
between a polymorphism at position 1243 of the comeodesmosin gene and psoriasis, 
A comeodesmosin gene haplotype was subsequently defined^ winch by TDT analysis 
was shown to have a strong, significant association with psoriasis (Allen et al (1999) 
Lancet 353:1589-90), 

15 

In human epidermis and other cornified squamous epithelia, comeodesmosin is 
located in the desmosomes of the upper living layers, and in related structures of the 
cornified layers^ the comeodesmosomes. Dming maturation of the cornified layers, 
the protein undergoes a series of cleavages, thought to be a prerequisite of 

20 desquamation (shedding of the cuticle or epidermis), Comeodesmosin is detected as a 
glycovSylated and phosphorylated basic protein with an apparent molecular mass of 52- 
56 kDa.. During stratum comeum maturation, comeodesmOvSin is progressively 
proteolysed until desquamation occurs., In superficial corneocytes, the 52-56 kDa 
form is no longer detected iind immunoreactive lr?:gments of 45 to 30 kDa 

25 predominate. Since location, biochemical characteristics and processing of 
comeodesmosin are similar in several maromals, it is likely that the protein is essential 
for the fanction of comeodesmosomes and corneocyte coliesion. It has been shown 
that expression of the 56kDa epidennal keratin polypeptide is increased in psoriatic 
lesions compared with normal skin and transformation of desmosomes into 

30 comeodesmosomes is altered in psoriatic epidermis. 
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Psoriasis affects approxitnately 6.4 millioii people m die US and causes vaiyiiig langes 
of physical discomfort, pain and disability. At present, the causes of psoriasis are 
unknown. There is no specific test for psoriasis or susceptibihty Uiereto, and diagnosis 
5 is based solely on clinical examination and skin histopathology. 

It is likely that comeodcsmosin is implicated in a range of sldn diseases, including 
psoriasis. In this text, diseases in wliich corneodesmosin is miphcated in tiie 
pathology will be referred to as "coraeodesmosin-mediated disease"., 

10 

The present invention aims to overcome or ameliorate previous limitations in the art 
by providing means and methods for the detection and treatment of individuals having, 
or being susceptible to, corneodesmosin mediated disease, in particular inflanunatoiy 
conditions such as psoriasis. 

15 

hi a first aspect, the present invention provides an isolated or recombinant 
polyimcleotide comprisijig a nucleic acid sequence encoding the comeodcsmosin gene 
of Figure 1, wherein said nucleic acid sequence comprises a nucleotide substitution, 
deletion or insertion at one or more of positions 6984, 7068, 7077, 7107, 7164, 8884, 

20 8906, 8931, 9538, 9607, 9608, 9647, 9667, 9745, 9761, 9926, 9952, 9968, 10082, 
10161, 10162, 10363, 11567, 11641, 11649, 11808, 11839, 11885, 11977, 12018, 
12136, 12149, 12198, 12283, 12318, 12345, 12373, 12901, 13001, 13020, 13108, 
13117, 13178, 13224, 13316, 13365, 13562, 13605, 13670, 13859, 13889 and 13914 
of Figure L These novel polymorphisms in the comeodesmosin gene, at the positions 

25 indicated above, may be responsible for corneodesmosin mediated disease. In 
particular, the polymorphisms of the present invention may be useful in identifying 
individuals susceptible or resistant to corneodesmosin-mediated disease, and in the 
diagnosis or treatment of such conditions. Preferred combinatioiis of the 
polymorphisms of the invention are the haplotypes shown in Tables 10a and b. The 

30 most prefeixed haplotype is B of Tab3e 1 Oa, 
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The polynucleotide of tliis invention is preferably DNA, or may be RNA or other 

options. 

5 By "isolated" is memt a polynucleotide sequence which has been purified to a level 
sufficient to allow allelic discrimination. For example, an isolated sequence will be 
substantially free of any otliet^ DNA or protein product. Such isolated sequences may 
be obtained by PGR amplification, cloning techniques, or synthesis on a synthesiser. 
By recombinant is meant polynucleotides which have been recombined by the hand of 
10 man. 

The comeodesmosin gone sequence shown in Figure 1 refers to the genomic clone of 
comeodesmosin, detailed in GenBank Accession No. AC006163 ( a genomic clone of 
the MHC region on chromosome 6p2L.3), The single nucleotide polymorphisms of 

15 the invention are shown m bold typt and nnderlhicd on this figure, and have each been 
given a positional reference with respect to this figure. For reference aiid comparison 
with piior art pubhcations, the positional references with respect to the coding 
sequence have also been given in Table 6, column 2, where nucleotide position 1 
corresponds to the iirst nucleotide of exon 1 and nucleotides upstream of this are given 

20 a negative prefix. 

A polymorpMsm is typically defined as two or more alternative sequences, or alleles, 
of a gene in a population. A polymorphic site is the location in the gene at which 
divergence m sequence occurs. Examples of the ways in which polymorphisms tire 

25 rnanifested include restriction fragment lengtli polymori^lusms, variable number of 
tandem repeats, hypervariable regions, minisatellites, di- or multi-nucleotide repeats, 
insertion elements and nucleotide deletions, additions or substitutions. The first 
identified allele is usually referred to as the reference allele, or the wild type. 
Additional alleles arc usually designated alternative or variant alleles. Herein, the 

30 sequence exactly as shown in Figure 1 is designated the reference sequence, and is not 
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part of liie invention. Nucleic acid sequences of the x)resent invention whicli differ 
fi'om the sequence of Figure 1 at one or more of the positions indicated above may be 
refeired to as variants of Figui'e 1 . 

5 A single nucleotide polymorphism is a variation in sequence between alleles at a site 
occupied by a single nucleotide residue. Single nucleotide polymorphisms (SNP's) 
arise from the substitution, deletion or insertion of a nucleotide residue at a 
polymoiplnc site. Typically, this results in the site of the valiant sequence being 
occupied by any base other than the reference base,. For example, where the reference 

10 sequence contains a "T" base at a polymorphic site,, a variant may contain a "C", "G" 
or "A" at that site, Single nucleotide polymorpliisms may result in corresponding 
changes to the amino acid sequence. For example, substitution of a nucleotide residue 
may change the codon, resultmg in an amino acid change. Similarly, the deletion or 
insertion of tln:ee consecutive bases in the nucleic acid sequence may result in the 

15 insertion or deletion of an amino acid residue. For ease of reference, where a single 
nucleotide poljonorphism of the present invention results in the insertion or deletion of 
a nucleotide or amino acid residue, the numbering system of Figures 1 and 2 have 
been maintained, 

20 The single nucleotide polymorphisms of the present invention which occur within the 
protein coding sequence may contribute to the phenotype of an oiganism by affecting 
protein structure or function. The effect may be neutral, beneficial or detiimental, 
depending upon the circumstances. Whatever the effect, the identification of such 
polymorphisms enables for Ibe first time detemiination of susceptibility to disease, and 

25 new methods of treatment. The single nucleotide polymorphisms of the invention 
which occur in the non-coding 5' or 3' untranslated regions, may not affect protein 
sequence, but may exert phenotypic effects by influencing rephcation, transcription 
and/or translation.. A polymorphism may affect more than one phenotypic trait or may 
be related to a specific phenotype. In the present invention, poiymoiphisms in the 

30 comeodcsxnosin gone are Ulcely to affect the phenotype of an individual with respect to 
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comeodesmosin-mediated disease, such as inflatnmatory disease, in particular 

psoriasis. 

The single nucleotide polymoiphisms of the comeodesmosin gene, including ttiose of 
5 the present inventioii, are listed in Table 6 where: 

• Column 1 designates each single nucleotide polymorphism a reference number, 

• Column 2 provides the positional reference of the polymorplnsm with respect to 
10 Figure 1, 

• Column 3 indicates position of the SNP with respect to the comeodesmosin coding 
sequence.. 

15 • Column 4 shows Uie location of the polymorphisms in the gene. 

• Column 5 shows the sequence flanlcing the polymorphism, tlie polymorphism itself 
being shown in bold type. For example, tlie polymorphism at position 6984 is 
shown as C/T, meaning that the variant sequence comprises a T residue, rather 

20 than the native C residue. 

• Colunmi 6 denotes the standard lUB code. 

As discussed above^ where a single nucleotide polymorphism of the present invention 
25 comprises a nucleotide substitution, the substitution may comprise the replacement of 
the reference base at a polymorphic site with any other base. Each single nucleotide 
polymorphism described in Table 6, column 4 represents a preferred embodiment of 
the invention. 
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It will be appreciated by those skilled in the art that comcodesmosiii gene sequences of 
the invention may comprise one or more nucleotide substitutions, deletions or 
inseilions in addition to one or more of the single nucleotide polymorphisms of the 
invention. 

5 I 

In a second aspect, fragments of the above polynucleotides are provided^ which 
comprise one or more nucleotide substitutions^ insertions or deletions at one or more 
of iho above mentioned positions of Figme L Preferably, a fragment may comprise, 
or even consist of, the polynucleotide sequence of Table 6, column 4. The novdty of 
10 a fragment according to the present embodiment may be easily ascertained by 
comparing the nucleotide sequence of a fragment with sequences catalogued in 
databases such 'as GenBank, or by using computer programs such as DNASIS (Hitachi 
Engineering, Inc.) or Word Seaj'ch or FASTA of the Genetic Computer Group 
(Madison, Wl). 

15 

Preferably, the fragments do not encode a full length protein, as is generally tiie case 
with the aforementioned polynucleotides, but othei-wise satisfy the requirements of the 
first aspect. Preferred fragments may be 10 to 150 nucleotides in length. More 
preferably, the fragments are between 5 tolO, 5 to 20, 10 to 20, 20 to 50, or 50 to 100 
20 nucleotides in length. For example, the fragments may be 5, 8, 10, 12, 15, 18, 20, 22, 
25, 28, 30, or 35 nucleotides m length. The fragments may be useful in a variety of 
diagnostic, prognostic or therapeutic methods, or may be useful as reseai:cli tools for- 
example in drug screening. 

25 In a third aspect of the invention, there is provided non-coding, colmplemcntary 
sequences wliich hybridise to the comeodesmosin gene sequence. Such *'anti-sense" 
sequences are useful as probes or primers for detecting an allele of a polymorphism of 
the invention, or in the regulation of the corneodesmosm gene. They may also be used 
as agents for use in the identification and/or Ixeatment of individuals having or being 

30 susceptible to comeodesmosin mediated disease. 
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The anti-sense sequences of the invention inchide those wliich hybridise to an allele of 
a polyinorpliism of the invention, and also those wliich hybridise a region flanldng the 
polymoipMc site to enable amplification of an allele of one or more polymorpliisnis.. 
5 These sequences may be usefiil as probes or prkners. To be useful as a probe, the anti- 
sense sequence should bmd preferentially one allele of one or more polymorphisms of 
the present invention and will, preferably, comprise the exact complement of one 
allele of one or more polymoipliisms of the invention. Thus, for example, where the 
Viiriant comprises a ''G" residue at position 7068 of Figure 1, it is preferred that the 

10 anti-sense sequence will comprise a residue. Such anti-sense sequences which 
are capable of specific hybridisation to detect a single base mis-match may be 
designed according to methods Icnown in the ait and described in Manialis et al, 
Moleadar Cloning: A Laboratory Manual Edition (1989), Cold Sprmg Harbor, 
NY and Berger et al. Methods in Enzyniology 152: Guide to Molecular Cloning 

15 Techniques (1987) Academic Press Inc. San Diego, CA; Gibbs ei aL, Nuc Acids Res., 
17: 2437 (1989); Kwok et al, Nuc Acids Res 18: 999; and Miyada et al. Methods 
EnzymoL 154: 94 (1987). Variation in the sequence of these anti-sense sequence is 
acceptable for tlie purposes of the present invention, provided that tlie ability of the 
anti-sense sequence to distinguish between alleles of a polymorphism is not 

20 compromised. Similarly, variation in the sequence of a primer sequence is acceptable, 
provided its abihty to mediate amplification of apolymorpliic site is not compromised. 
Preferably, a primer^ sequence will hybridise to the comeqdcsmosin gene under 
stringent conditions which are defined below^ 

25 In relation to the present invention, "stringent conditions'' refers to the wasliing 
conditions used in a hybridisation protocol hi general, the washing conditions should 
be a combination of temperature and salt concentration so that the dcnaturation 
temperature is approximately 5 to 20''C below the calculated of the nucleic acid 
under study„ The Tj,, of a nucleic acid probe of 20 bases or less is calculated mider 
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Standard conditions (IM NaCl) as [4°C x (CtVC) + 2^C x (A+T)], according to Wallace 
rules for short oligoiTucleotides. For longer DNA fragments, the nearest neighbour 
method, wliich combines solid thermodynamics and experimental data may be used, 
according to the principles set out in Bieslaucr et al, FNAS 83: 3746-3750 (1986), 
5 The optimum salt and temperatnre conditions for hybridisation may be readily 
determined in prelhninary experinients in which DNA samples immobilised on filters 
are hybridised to the probe of interest and then washed mider conditions of different 
stringencies. W3.iile the conditions for PGR may differ from the standard conditions^ 
the Tm may be used as a guide for the expected relative stabihty of the primers. For 
10 short primers of approximately 14 nncleotides, low aimealing temperatures of around 
44'^C to SO^^C are used. The temperature may be higher depending upon the base 
composition of the primer sequence used. 

The anti- sense polyiiacleotides of this embodiment may be the full length of the 
15 corneodesmosin gene of figure 1, or more preferably may be 5 to 200 nucleotides in 
length. Preferred polynucleotides are 5 to 10, 10 to 20, 20 to 50, 50 to 100 or 100 to 
200 nucleotides in length. Primers, in particular, arc typically 10 to 15 nucleotides 
long, and may occ£^ionally be 16 to 25. 

20 In a preferred embodiment, the polynucleotides of the aforementioned aspects of tire 
Uivention may be in the fomi of a vector, to enable the in vitro or in vivo expression of 
the polynucleotide sequence. The polynucleotides may be operably linked to one or 
more regulatory elements includuig a promoter; regions upstream or downstream of a 
promoter such as enhancers which regulate the activity of the promoter; an origin of 

25 rephcation; appropriate restriction sites to enable cloning of inserts adjacent to the 
polynncleotide sequence; markers, for example antibiotic resistance genes; ribosome 
binding sites: RNA splice sites and transcription teimination regions; polymerisation 
sites; or any other element which may facilitate the cloning and/or expression of the 
polynucleotide sequence. Where two or more polynucleotides of the invention are 
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mti'oduced into the same vector, each may be controlled by its own regulatory 
sequences, or all sequences may be contiolled by the same regulatory sequences. In 
the same manner, each sequence may comprise a 3' polyadcnylation site. The vectors 
may be introduced into microbial, yeast or animal DNA, either clnomosomal oi' 
5 mitochondrial, or may exist independently as plasrajds* Examples of suitable vectors 
will be known to persons slcilled in the ait and include pBluescript H, LambdaZap, and 
pCMV-Script (Stratagene Cloning Systems, La Jolla (USA)) 

Appropriate regulatory elements, in particular, promoters will usually depend upon the 
1 0 host cell hito which the expression vector is to be inserted. Where microbial host cells 
are used, promoters such as the lactose promoter system, tryptophan (Trp) promoter 
system, p-lactamase promoter system or phage lambda promoter system are suitable. 
Where yeast cells are used, preferred promoters include alcohol dehydrogenase I or 
giycolylic promoters, hi mammalian host cells, prefeired promoters are those derived 
1 5 from unmunoglobulin genes, SV40, Adenovirus, Bovine Papilloma virus etc. Suitable 
promoters for use in various host cells would be readily apparent to a person sldlled m 
tlie art (See, for example. Current Protocols m Molecular Biology Edited by Ausubcl 
et al, published by Wiley). 

20 hi a fourth aspect of the present invention there is provided a protein or protein 
fragment comprising an amino acid substitution, deletion or insertion at one or more of 
positions 18, 130 or 180 of the amino acid sequence of Figure 2. Preferably, the 
protein or protein fragment is encoded by a polynucleotide according to the first aspect 
of the invention, and comprises a nucleotide insertion, deletion or substitution at one 

25 or more of positions 7164, 10082, 10161, 10162 aiid 10363 of Figure 1, The 
comeodesmosin protein or protein fragments of the invention may comprise one or 
more polymorphisms in addition to one or more of the above-mentioned 
polymorphisms of Fignre 2. 



wo 01/62788 



12 



PCT/GBOl/00795 



The amino acid sequence exactly as shown in Figure 2 may be referred to as the 
reference sequence, and is not part of the mvcntion. The amino acid sequence of 
Figure 2 having an amino acid substilution, deletion or iiiscitioii at one or moi'e of the 
positions indicated above may be refen'ed to as a vati^int of Figure 2> The reference 
5 amino acid at one or more of the above polymorphic sites may be replaced by any 
other amino acid residue to produce a variant sequence. Amino acid sequences of 
Figure 2 having one or more of the polymoiphisms disclosed in Table 4 are each 
preferred embodiments of the hivention. 

10 Protem fragments may be functional or non-functional and may be useful in drag 
screening or gene therapy. Fmictional fragments may be defined as those which have 
characteristics of the comeodesmosm protein. The fragments may be at least 10, 
preferably at least 15, 20, 25 30, 35, 40 or 50 amino acids in length. 

15 In a fifth aspect of the present invention, there are provided antibodies which react 
witJ.i an antigen of a protein or protein fragment of the fourth aspect., Antibodies can 
be made by the procedure set forth by standard procedures (Harlow and Lane^ 
"Antibodies; A Laboratory Manual" Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1998). Briefly, purified antigen can be injected into an animal in 

20 an amount and in intervals sufficient to elicit an hnmune response. Antibodies can 
either be purified directly, or spleen cells can be obtained from the animal The cells 
are then fused with an immortal cell line and screened for antibody secretion. The 
antibodies can be used to screen DNA clone libraries for cells secreting the antigen. 
Those positive clones can flien be sequenced as described in, for example, Kelly at aL, 

25 Bio/Technology 10:163467 (1992) and Bebbington et al, Bio/Technology 10:169- 
175 (1992), Preferably, the antigen being detected and/or used to generate a particular 
antibody will include proteins or protem fragments according to the fourth aspect. 

In a sixth aspect of the present invention, there is provided host cell comprising a 
30 polynucleotide according to any of the aforementioned aspects, for expression of the 
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polynucleotide. The host cell may comprise an expression vector, or naked DNA 
encoding said polynucleotides. A wide variety of suitable host cells are available, 
both eukarj'otic and prokaryotic. Examples include bacteria such as E. colU yeasty 
jSlanientons fungi, insect cells, maniinalian cells, preferably iimnoitalised, such as 
5 mouse, CHO, HeLa^ myeloma or Jmkat cell lines, human and monkey cell lines and 
derivatives thei-eof Such host cells are useful in drag sci^eening systems to identify 
agents for use in diagnosis or treatnent of individuals having, or being susceptible to 
corneodesmosiii mediated disease, 

10 The method by which said polynucleotides are introduced into a host cell will usually 
depend upon the nature of both the vector/DNA and the target cell, and will include 
those knowi to a person sldlled in flie art. Suitable loaown methods include fusion, 
conjugation, transfection, transductioiL, electroporation or injection, as described in 
Sambrook et ah 

15 

In an seventh aspect of the present invention, there is provided a tiansgenic non- 
human animal comprising a polynucleotide according to an aforementioned aspect of 
the invention, Prefer^ibly, tlie transgenic, non-human animal comprises a 
polynucleotide according to the first or second aspects. The transgenic animal may be 

20 either homozygous or heterozygous for the variant sequence. The animal, and cells 
derived therefi'om, are useful for screening biologically active agents that may 
modulate corneodesmosin function. Such screening methods are of particular use for 
determining the specificity and action of potential therapies for corneodesmosin 
mediated disease, such as psoriasis.. The animals are useful as a model to investigate 

25 the role of comeodesmosin in normal sldn function. Transgenic non-human ammals 
are also usefiil for flie analysis of the single nucleotide polymorphisms and then 
phenotypic effect , 

Expression of a polynucleotide of the invention in a tiransgemc non-human animal is 
30 usually achieved by operably Unking Uie polynucleotide to a promoter and/or enhancer 
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sequence, preferably to produce a vector of the fourth aspect, and introducing this into 
cm embryonic stem cell of a host animal by niicroinjection techniques (ITogan et aL, A 
Laboratory Manual, Cold Spring harbour and Capccclii Science (1989) 244: 12S8- 
1292),, Preferably, the construct to be introduced into the animal additionally 
5 comprises a) a first homology region with substantial identity to a first 
corneodesmosin gene sequence; and b) a second homology with substantial identity to 
a second corneodesmosin gene sequence.. The first and second homology regions are 
of sufficient lengtli for homologous recombination to occur with an endogenous 
corneodesmosin gene. Those embryonic stem cells comprising the desired 

10 polynucleotide sequence may be selected, usually by niotiitoring expression of a 
marker gene, and used to generate a non-human ti^ansgenic animaL Preferred host 
animals include mice and other rodents, Fuifher development of such an embryonic 
stem cell may produce a transgenic animal having cells that are descendant from the 
enibiyonic stem cell and thus carry the variant sequence in their genome. Such 

15 ajiinials can theji be selected and bred to produce animals havijig the variant sequence 
in all somatic and genu cells. Such mice can then be bred to homozygosity. 

In a prefeixed embodmient, the transgenic non-human animal may comprise an anti- 
sense nucleic acid sequence of the tlurd ^pect. The expression of an anti-sense 
20 sequence in a transgenic non-human animal may be useful in detcnnining the effects 
of such sequences in treating comeodesmosin- mediated disease, or in neutrahsing 
deleterious effects of vaiiant comeodesmosin genes in a3i animal Preferably , the host 
animal will be one which suffers from corneodesmosin mediated disease. The disease 
may be naturally occurring or artificially introduced. 

25 

In some preferred embodiments, for example where the mediated disease has been 
artificially induced, the transgenic noiiTiunian animal will be modulated to no longer 
expresses the endogenous comeodesmosin gene. Such animals may be refeijed to as 
"knock out". In some cases, it may be appropriate to modulate the expression of the 
30 endogenous comeodesmosin gene, or express flie polynucleotides of the present 
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iiiventioTi, in specific tissues. Tiiis approach removes viability problems if the 
expression of a gone is abolished or induced in all tissues. Preferably, the specific 
tissue would be skin. Where the heterologous gene is hiunan, the animal may be 
usefiil in identifying agents which inhibit expression oi' activity of the vaiiaat 
5 comeodesmosin sequences of the invention, cither in vivo or in vitro. 

In an eighth aspect of the present invention there is provided a metliod of sci'ecning for 
agents for use in the prognosis, diagnosis or treatment of individuals having, or being 
susceptible to, comeodesmosin-mediated disease, said method comprising contacting a 

10 putative agent with a polynucleotide or protein according to an aforementioned aspect 
of the present uivention, and monitoring the reaction there between. Preferably, the 
metliod further comprises contacting a putative agent with a leference polynucleotide 
or protein of Figure 1 or 2 respectively, and comparing the reaction between (i) the 
agent and the polynucleotide or protein encoding the reference allele; and (ii) the agent 

15 and polynucleotide or protein of the invention. Potential agents are those which react 
differently wifli a vmimi of the invention and a reference allele. It is envisaged that 
the present method may be carried out by contactuig a putative agent with a host cell 
or transgenic non-human animal comprising a polynucleotide or protein according to 
tlie invention. Putative agents will include those known to persons skilled in the art, 

20 and include chemical or biological compounds, such as anti-sense polynucleotide 
sequences, complementar)-^ to the coding sequences of the first aspect, or polyclonal or 
monoclonal antibodies which bind to a product such as a protein or protein fragment 
of the second aspect. The agents identified in tlie present method may be usefiil in 
determining susceptibihty to corneodesmosin-mediated disease, or in the diagnosis, 

25 prognosis or treatment of said disease. 

In a mnth aspect of the present invention, there is provided a method of diagnosmg, oi 
determining susceptibility of a subject to corneodesmosin-mediated disease^ said 
metliod comprising determining which allele of one or more of the polymorphisms of 
30 the invention is present in a subject. The above method may be used in diagnosing or 
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determining susceptibility of a subject to any disease in which comeodesmosin is 
implicated m the pathology, in paiticular inflammatory disease, such as psoriasis. The 
method of the ninth aspect may also he used to identify the presence of a combination 
of single nucleotide polymorpliisms in a subject which define a haplotype linlced to 
5 comeodesmosin mediated disease. The haplotype may be any particular combination 
of the above single nucleotide polymoipMsms, optionally including Icnown 
polymorphisms. Preferred haplotypes are those shown in Table 10a, the most 
preferred haplotype being B of Table 1 Oa. 

10 Any mefliod, including those known to persons skilled in the art,, may be used to 
determine which allele of one or more polymorphisms of the invention is present. 
Preferably, the method comprises first removmg a sample from a subject. More 
preferably, the method comprises isolating from a sample a polynucleotide or protein 
to determine therein which allele of one or more polymorphisms of the mvention is 

15 present. 

Any biological sample comprising cells containing nucleic acid or protein is suitable 
for this purpose. Examples of suitable samples include whole bloody semen, saliva, 
teai-s, buccal, sldn or hair. For analysis of cDNA, mRNA or protein, the sample must 
20 come fi-om a tissue in which the comeodesmosin gene is expressed^ and thus it is 

preferable to use skin samples, 

Li a preferred embodiment, the method for diagnosing, or determining susceptibility of 
a subject to a corneodesmosin-mediated disease, comprises determining which allele 
25 of one or more polymorphisms of the invention is present, in a polynucleotide. Any 
method for determining alleles in a polynucleotide may be used, including those 

Ivnown to persons skilled in the art. Preferably, the method may comprise the use of 
anti-sense polynucleotides, as defined above. Such polynucleotides may include 
sequences which are able to distinguish between alleles of one or more polymorphisms 
30 of the invention, by preferential bindmg, and sequences which hybridise under 
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stringent conditions to a region either side of a polymorphism of the invention to 
enable amplification of one or more of the polymorphisms. 

Melliock of tliis embodiment include those laiown to persons sldlled in the art, for 
5 example direct probing, allele specific hybridisation, PGR methodology including 
Allele Specific Amplification (ASA), and RFLP. 

Determination of an allele of a polymorphism using direct probing mvolves the use of 
anti-sense sequences of the third aspect of the invention. These may be prepared 

10 synthetically or by nick translation. The anti-sense probes may be suitably labelled 
using, for example, a radiolabel, enzyme label, fluoro-label, biotin-avidin label for 
subsequent visualization in, for example, a southern blot procedure. A labelled probe 
may be reacted with a sample DNA or KNA, and the areas of the DNA or RNA which 
cany complimentary sequences will hybridise to the probe, and become labelled 

15 themselves. The labelled areas may then be visualized, for example by 
autoraihogi'aphy^ 

Allele specific ampUfication (ASA) discriminates between alleles of a polymoqihism 
on the basis of primers which cany 3' nucleotides specific for a particular 

20 polymorphism. Typically, fu^st and second forward prhners are provided, wherein the 
first forward primer hybridises to one allele of a polymorphism of the invention, and 
the second forward primer comprises a mis-match at the pol;i^norphic site, thus 
preventing hybridisation. These prmiers are used in combination with a backward 
primer, which hybridises to a distal site to enable amplification of the region between 

25 a forward primer and the backward primer. As the first forward primer will only bind 
to a polymoiphic site with which it exliibits perfect complementarity, amplification of 
the region between the forward and backward primers will indicate the presence of a 
paiticiilar allele„ The second forward primer havmg a mis-match at the polymoiplric 
site will not hybridise to the particular allele of a polymorphism, and ihc absence of a 

30 amplificationL product when this primer is used indicates the absence of the 
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polymoipliism. Preferably, the forward primer will be an anti-sense sequence 
according to the tliiid aspect of the invention. Preferably, the fh^st fonvard primer will 
comprise the complement of a single nucleotide polymorphism of the invention at the 
3' most position. The backward, piimer may hybridise to any suitable portion of the 
5 corneodesmosha gene to enable amplification of the intervening region, (see, for 
example, W093/22456) 

Thus, in a prefeired embodiment there is provided a method for ciiagnosing or 
determiiiing susceptibility of a subject to comeodesmosin-mediated disease, said 

10 method comprising removing a sample firom a subject and isolating the nucleic acid 
flierefrom; contacting the sample wifli either a forward prhner which preferentially 
hybridises to one allele of one or more polymorphisms of the present invention or a 
forward piimer which comprises a mis-match at the polymorphic site and does not 
hybridise thereto, and a backward primer which hybridises to a distal site; subjecting 

15 the nucleic acid sample to amplification; and monitoring for presence of an 
amplification product which is hidicative of the presence of a particular allele of one 
or more of the polymorphisms of the invention. Preferably, a first reaction is 
performed using one of the foiwaxd primei-s, and a control reaction is flien performed 
using the other forwai'd piimer. It is envisaged that a number alleles of the single 

20 nucleotide polymorphisms of the invention may be detected in a single reaction by 
using multiple primer- pairs. Amplification products may then be distinguished by 
size, using techniques known in the ait such as gel electrophoresis, or southern 
blotting. This metliod allows the unambiguous identification of individuals 
homozygous for either allele bs well as heterozygous individuals. 

25 

"RFLP" refers to restriction fi^agment length polymoiphism and is defined as a method 
of discriminating between two alleles based upon differences in sequence which result 
in the presence or absence of a restriction enzyme recognition site. In a preferred 
embodiment of the present aspect there is provided a method for diagnosing or 
30 determining susceptibility to corneodcsmosin-mediated disease, said method 
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compiisiug removing a nucleic acid sample fi-oni a subject, and contacting with one or 
more appropriate restriction enzymes. The size of fragments produced is indicative of 
Vv^hich allele of one or 3noie single nucleotide polymoiphism according to the 
invention is piesent. An allele of a polymorpliism of the invention may naturally 
5 produce a restriction enzyme site, thus allowing for determination of its presence by 
analysis of the restriction fragments produced. In some cases, however, an allele of a 
polymorpliism does not create a restriction enzyme site, and one must be artificially 
inti^odaced. Tliis may be done by using a suitable mis-match primer, according to 
methods known in the arL 

10 

The appropriate restiiction enzyme, will, of course^ be dependent upon the 
polymotphism and lestriction site, and will include those known to persons skilled in 
the art. Preferred restiiction enzymes are hsted in Table 3 (ii), column 11, with the 
expected fragments sizes in columns 7, 8 and 9. Analysis of the digested fragments 
15 may be performed using any method in the art, for example gel analysis, or southeni 
blots„ 

Preferably, the method may first comprise the amplification of a region of the 
corneodesmosin gene containing one or more of the polymorphic sites of the 
20 invention, for example, using PGR techniques. The probes of the present invention 
may be useM for this purpose„ 

The above described methods may require amplification of the DNA sample from the 
subject, and this can be done by techniques known in the art, such as PGR (see PCR 

25 Technology: Principles and Applications for DNA Amplification (ed. H. A. Eriich, 
Freeman Press, NY 1992; PCR Protocols: A Guide to methods and Applications (eds, 
Imiis ei al. Academic press, San Diego, CA 1990); Mattila ei ah. Nucleic Acids Res. 
19 4967 (1991); Eckert et al, PCR Methods and Applications 117 (1991) and US 
Patent No 4, 683, 202. Other suitable amplification mediods include hgase chain 

30 reaction (LGR) (Wu et aL, Genomics 4 560 (1989); Landegran et aL, Science 241 
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1077 (1988)), transcription ampIijScatiou (K.woh et al, Proc Nail Acad Sci USA 86 
1 173 (1989)), self sustained sequence replication (Guatelli et al, Proc Natl Acad Sci 
USA 87 1874 (1990)) and nucleic acid based sequence amplification (NASBA)„ Tlie 
latte]' two metliods both involve isothennal reactions based on isothenrLal transcription 
5 wliich produce both single stranded RNA and double stranded 13NA as the 
amplification products, in a ratio of 30 or 100 to 1, respectively. 

It may often be desirable to identify the presence of multiple single nucleotide 
polymorphisms in a sample from a subject. This may be the case m the present 
10 invention where the comeodesmosm gene contains 39 polymorphisms, each of which 
may be indicative of a different phenotype. For tiiis purpose, nucleic acid arrays may 
be useful, as described in W095/11995- The array may contain a number of probes, 
each designed to identify one or more of the above shigle nucleotide polymorpliisms 
of the comeod-esmosin gene, as described in W095/1 1995. 

15 

In a further preferred embodiment of the ninth aspect, the method may comprise 
detemiimng wliich allele of one or more polymorphisms is present m a protein of the 
invention Any method for determining the presence of a particular form, or allele, of a 
protein is present, may be used. One such method involves the use of antibodies in 

20 diagnosing or determining susceptibility to comeodesmosin mediated disease., The 
method may comprise removing a sample from a subject, contacting the sample with 
an antibody to an antigen of a protein or protein fragments according to the second 
aspect of the present invention, and detecting bmding of the antibody to the antigen, 
wherein binding is mdicative of the presence of a particular allele or form of the 

25 protein and thus risk to comeodesmosin mediated disease. Tissue samples as 
described above are suitable for this method. 

The detection of binding of the antibody to the antigen in a sample may be assisted by 
methods known in the art, such as tiie use of a secondaiy antibody which bmds to the 
30 first antibody, or a hgand. hnmunoassays including immmiofluoresceuce assays (IF A) 
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aiid enzyme imlccd immunosorbent assays (ELISA) and immnnoblotting may be used 
to detect the presence of the antigen. Por exatnple, where ELISA. is used, ttie method 
may comprise binding the antibody to a substrate, contacting the bound antibody with 
the sample containing the antigen, contacting the above with a second antibody boimd 
5 to a detectable moiety (typically an enzyme such as horse radish peroxidase or alkaHnc 
phosphatase), contacting the above wth a substrate for the enzyme, and finally 
observing the colour change which is indicative of the presence of tlie antigen in the 
sample. 

10 In a tenth aspect of the invention, there is provided a method of treating a subject who 
has been diagnosed as having, or being susceptible to, comeodesmosiii mediated 
disease such as psoriasis. The mode of treatment will depend npon tli^e nature of the 
polymorplHsm(s) and tbe phenolypic effect, and prcfcrably comprises negating the 
effect of the disease causing polymorplnsm(s). Where a subject has been diagnosed 

15 according to the methods of the invention, treatment to negate the effect of the disease 
causing polymoiplnsm may include any suitable means.. A suitable treatment 
includes the administration of a polynucleotide sequence which hybridises, preferably 
under stringent conditions (as defined above), to flie corneodesmosin gene. Such 
polynucleotide sequences may hiclude the anti-sense sequences of t!ae third aspect. 

20 Alternatively, the treatment may comprise a polynucleotide sequence encoding die 
corneodesmosin gene or a fragment thereof, and having either a refeience or variant 
allele of a polynioipliism of the invention. Preferably, the method comprises: 
(i) determining which allele of one or more polymorphisms of the invention are 
present; and 

25 (ii) administering a polynucleotide sequence which hybridises mider stringent 
conditions to the corneodesmosin gene; or a polynucleotide sequence encoding the 
reference sequence of the corneodesmosin gene or a fragment thereof, or a 
polynucleotide sequence of the first aspect. 

30 In an alternative embodiment of diis aspect, there is provided the use of a 
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polynucleotide sequence of the tenth aspect in the manufactuie of a medicament for 
use in the diagnosis and treatment of corneodesniosm mediated disease. 

Tliis method of diagnosis and treatment may comprise determining and introducing 

5 alleles in the form of a polynucleotide or protein,, hi the above embodunents, the 
allele of a polymorphism may be detennined using any mefliod, as discussed above. 
The treatment may be introduced in flie form of a protein, or polynucleotide. Any 
suitable means for introduction of a protein may be used. Introduction of a 
polynucleotide may use gene therapy methods including those known in the art. In 

10 genej:al, a polynucleotide encoding the allele will be mtroduced into the target cells of 
a subject, usually in the form of a vector and preferably in the fonn of a 
phaitnaceutically acceptable carrier. Any suitable delivery vehicle may be used, 
including viral vectors, such as retroviral vector systems which can package a 
recombinant genome. Tlie retrovirus could flien be used to infect and deliver the 

15 polyinicleolide to the target cells. Other deliveiy techniques are also widely available, 
including the use of adenoviral vectors, adeno-associated vectors, lentiviral vectors, 
pseudotyped retroviral vectors and pox or vaccinia vnus vectors. Liposomes may also 
be used, inchiduig commercially available liposome preparations such as Lipofectin 
®, Lipofectamme ®, (GIBCO-BRL, Tnc, Gaitherburg, MD), Superfect ® (Qiagen Inc, 

20 Hilden, Germany) and Transfectam ® (Promega Biotec Inc, Madison WI). 

Tlie polynucleotide or velncle may be administered paienterally (eg, inteavenously), 
transderaially, by intramuscular injection, topically or the like. As conieodesmosin 
mediated diseases are usually manifested in the skin, topical administration is 
25 preferred. The exact amount of polynucleotide or veliicle to be administered will 
vary from subject to subject and will depend upon age, weight, general condition, and 
severity or mechanism of the disorder. 



30 



In a further aspect, the present invention piovides a kit for die detection in a subject of 
a single nucleotide polymorplnsm according to the present invention. Preferably, the 
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kit will contain polynucleotides according to the aforemeiitioned aspects, most 
preferably the anti-sense sequences of the third aspect for use as probes or prioiers; 
antibodies of the fifiii aspect; oi restriction enzymes for use in detectmg the presence 
of a polynucleotide^ protein or protein jfragment of the invention. Preferably, the kit 
5 will also comprise means for detection of a reaction, such as nucleotide label detection 
means, labelled secondary antibodies or size detection means. In yet a fiirfber 
preferred embodiment^ the polynucleotides, or antibodies may be fixed to a substrate, 
for example an array, as described in W095/1 1995. 

10 The preferred embodiments of each aspect apply to the otlrer ^pects of the invention, 
mutatis mutandis. 

The present invention will now be described by way of a non-limiting example, with 
reference to the following figures in which: 

15 

FIGURE 1 shows the nucleotide sequence of the genomic clone of the 
comeodesmosin gene, of GenBanlc Accession No. AC006163. 

FIGURE 2 shows the ammo acid sequence of the comeodesmosin protein and coding 
20 sequence therefor. 

FIGURE 3 shows tlie exon and intron structure of tlie comeodesmosm gene. 



wo 01/62788 



24 



PCT/GB01/()(>795 



Examples 

Detc mimatjon of Gene Structure 

The mRNA sequence of the comeodesniosm gene (GciiBank Accession ID 
5 >}M_001264) was used to screen the following public DNA databases: (available 
through the National Centre for Biotechnology Information website - 
http://www.ncbi.nlm.nih.gQvA : NR (Non-Rednndant DNA), HTGS (High Throughput 
Genomic Sequence), and GSS (Genome Survey Sequence). The analysis was 
performed using the BLASTN algorithm (Altschul, et al, (1990) 1 Mol Biol 
10 215;403'410)> Any genomic sequences containing the comeodesmosin gene were 
identified by their degi'ee of sequence identity. The gene structure was determined by 
compaiison of the mRNA sequence with the genomic clones. The deduced oxon-intron 
organisation of the comeodesmosin gene is presented in Figure 3. 

15 Qligottuclcotidc primer dcsian for comeodesmosin gene sequencing 

5 pairs of oligonucleotide prmicrs (SIF/SIR; S2JF/S2.1R; S2.2F/S2.2R; 
S23F/S2.3R; S2.4F/S2.4R, S2,5F/S2.5R - Table 1) were designed to 
amphfy exons 1 and 2 of the corneodesmosm gene including 350bp 5' untranslated 
20 region (UTR) and 909bp 3' UTR sequences. Oligonucleotide primer sequences were 
derived from human cluromosome 6p21 genomic DNA sequence (GenBank Accession 
AC006163). 

Table 1 : Oligonucleotide Primer DNA Sequences, 

25 



Prinier ID 


Primer Sequence 


S IF 


DCTGGGTCCCGTGGCAAGA 


SIR 


DGTCCTCTCCCGGAGTCTC 


S2.,1F 


DGGTGAGGGAGGAAGCCAAG 
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S2.1R 


DGAGCTGACGCTTTGGCCAC 


S 2,2F 


DGCCAACCAATGACAA.CTCTTACC 


S2.2R 


DGCCTCCACAGAGCTGGAC 


S2.3F 


DGGCAAATACTTCTCCAGCAACC 


S 2.3R 


DGGCCTTCTCCCATATGGGA 


S2.4F 


DCCAAGGAGAGTTACTCGACAG 


S 2.4R 


DGGCATATTGGGTGGGTTGAC 


S2.5F 


DCATCTGGAAACAGTGGCCAC 


S2.5R 


DGTCTTCCTCCTCTGTGGGAG 



Comeodesmosin gene amplificatioii 

Genomic DNA from a panel of 24 unrelated individuals was amplified using primer 
5 pairs SIF/SIR; S2.1F/S2.1R; S2.2F/S2.2R; S2.3F/S2.3R; S2.4F/S2.4R, S2.5F/S2.5R. 
lOOng genomic DNA was amplified by PGR in a total reaction volume of 25^1 
containing 50mM KCl, 20niM Tris.HCl (pH 8.4), 2niM MgCIj 200nM each dATP, 
dCTP, dGTP, dTTP, \\iM each oligonucleotide primer and 0.5 units AmpUTaq Gold 
DNA polymerase (Applied Biosystems). Reactions were thennocyclcd with an initial 
10 denaturation step of 95"C/10mins followed by 35 cycles of 94''C/30secs; 

annealing/30secs; 72"C/30secs. A final elongation step of 72''C/I0mins completed the 
amplification. Aimealing temperatnres (T„;) for specific pruner pairs are presented in 
Table 2. 

15 Table 2: Primer AnneaJinK Temperatures and Amplimer Sizes. 



Amplimer 


Primer Pairs 


Fragment size (bp) 


Tm(oC) 


1 


SIF and SIR 


495 


63 


2,1 


S2.1FandS2.1R 


610 


62 


2.2 


S2,2FandS2.2R 


619 


62 
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2.3 


S2.3Fa3idS2.3R 


621 


63 


2.4 


S2.4F and S2.4R 


532 


59 


2.5 


S2.5F aiid S2.5R 


474 


61 



Heterodu plcx analysis using DHPTX: 

5 Oligos were designed to amplify products of between 400-800bp in length fxom the 
genomic DNA of 12-25 individuals. Denaturing liigh-performance ^ liquid 
cliromatography (DHPLC) analysis was performed using the WAVE™ DNA 
fragment analysis system (Tiansgenomic) (Kuklin, et aU (1997-98) Genet Test 1(3): 
201-6.)* The temperature requii"ed for successful resolution of heteroduplcx molecules 

10 within each PGR product was detennined empirically by injecting PGR product at a 
series of increasing mobile phase teniperalnres and constracting a fragment specific 
melthrg cm-ve, A universal gradient for double stranded DNA was used to detemiine 
the appropriate acetonitrile concentration for the heteroduplex identification. For 
mutation detection, 1 -2ul aliquots of tlie PGR. reactions from each of the eleven 

15 individuals were injected onto the WAVE^^ colunm.. Mutation detection gradients 
were for four mmutes. Results were graphically visualised using the D-7000 HSM 
software (Transgenomic), 

Direct sequencing of PGR products 

20 

50400ng of PGR products were sequenced in both orientations using the DYEnamic ET 
terminator cycle sequencing prcniix kit from Amcrsham. Reactions were fractionated on 
ABI 377 automated sequencers using standard procedures. Chromatographic traces were 
analysed using the SEQUENCHER programme (Gene Codes, USA), to identify SNP 
25 positions. 
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Detection of Variant Alleles - Assay desi gn for genotyping 

The fragment sequence containing the polymorpliism was analysed for the creation or 
deletion of a natuially occuirmg restriction enzyme recognition site in response to 
5 variation, in the nucleotide sequence. If the polyinorjilrism did not result in any 
changes in restriction enzyme recognition sites then the sequence was interrogated 
with the Primer Design Mismatch Program™. This is an adaptation of the program 
described by Davidow LS ((1992) Comput Appl Biosci 8:193-194), 

10 Detection of Polymorphisms in 24 Population Controls 

The application of the approach outlined above resulted in the identification of 39 
SNPs- These are described in Table 3, in which: 

• Column 1 designates each single nucleotide polymorphism a reference number. 

15 

• Column 2 provides the positional reference of the polymoiphism with respect to 
Figure 1, together with details of the polymorphism itself. For example^ the 
reference "C6948T" indicates a substitution of the nucleotide "C" for nucleotide 
'T" at position 6984 of Figure L. 

20 

• Column 3 of (i) provides the correspondijig positional references with respect to 
the coding sequence of die corneodesmosin gene. 

• Column 4 of (i) indicates the region of the gene which the polymorphism occurs. 

25 

• Colunm 5 of (i) shows the sequence flanldng the polymorphism, the polymorphism 

itself being shown in bold type. The single nucleotide polymorphisms are deiined 
using standard lUB code. 
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• Columns 3 and 4 of (ii) show primer sequences which may be used to ampUfy a 

region of tli.e corneodesmosin gene to enable detection of the single nucleotide 
polymorphism by using restriction enzyme analysis., The amplified product size is 
shown in column 5 of (ii). 

5 

• Columns 6 to 9 of (ii) list the restriction en2ymes used to digest the iunphfied 
product, and the sizes of fragments generated by the reference, variant and 

heterozygous sequences respectively. 

10 RFLP or ASA assays were developed for all of these SNPs and the corresponding 
primers along with amplification product and digestion fragment sizes are also given 
in Table 3., Of these 39 SNPs, 9 give rise to amino acid, changes. These are shown in 
Table 4. 

15 A dditional Corneodesmosin Polymoq^hisms 

In a subsequent experiment, DNAs fi'om 96 individuals comprising 24 type lA 
psoriatics, 24 type IB psoriatics, 24 type U psoriatics iind an additional 24 healthy 
controls, were sequenced as described above using prhners designed to cover the 
20 remainder of the Corneodesmosin gene (see Table 5a) 

The sequencing reactions were carried out with 50- lOOng of PGR. products sequenced 
in both orientations using the DYEnaraic ET terminator cycle sequencing premix kit 
from Amcrsham according to the following protocol: 

25 

The PGR products were Exo/Sap treated and desalted using plO columns, prior to 
setting up the sequencing reactions in a theiinoweil plate inchiding: 



200-400ngPCR Product 
30 1 p-1 primer @ 1 Opmolmr 
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8[i.l ET Termination mix 
HaOtolOp,! 

The plates were sealed with an MJ Research Microseal film and then vortexed to mix 
5 samples, followed by a spin to ensure reaction is at the bottom of the wells. 

PGR was carried out according to tiie following protocol: 
No Predeiiaturation 
95''Ctbr30sec 
10 50 for 15 sec 

60 T for 1 min 

for 40 cycles and then hold at 10 °C until ready to purify. 

After removing the plate from the fliennocycler, the products were purified by ethanol 
15 precipitation. To each well we added 2|i,l 7 .5M ammonium acetate followed by SOp-l 
100% ethanol and incubated at room temperaliue for 10 minutes before spuming at 
4000rpm for 1 hour at room temperatiu'e. Tlie supernatant was discarded and the 
pellet w^hed with 70% ethanol before centrifugation for a further 30 minutes., The 
supernatant was discai'dcd and remaining etlianol removed gently by pipettuig using 
20 pi 0 tips before allowing the pellets to air dry. 

The samples were then resuspended m lO^il MegaBACE Loading Buffer (Molecular 
Dynamics) and transferred to a Robbins plate prior to loading onto the MegaBACE. 
Reactions were fractionated on a Molecular Dynamics MegaBACE capillary 

25 sequencer usmg standard procedures. Chroraatograpliic traces were analysed using 
the SEQUENCHER programme (Gene Codes, USA), to identify SNP positions. 
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A total of 28 novel SNPs were identified (additional to those given iii the example 
above)„ For reference, these are SNPs 6-18 and 53-67 in Table 5b. A combined list of 
Corneodesmosin. SNPs is given in Table 6, 

5 Corneodesmosin ^ene association with psoriasis 

A total of 21 SNPs (see Table 7) were genotyped in 147 families identified through a 
proband with psoriasis (a total of 499 mdividuals, of whom 233 were affected). The 
genolyping was carried out using a variety of methods (single base extension using the 
10 Snapshot kit from Amersham Phamiacia Biotech, Pyrosequencing (Ahmadian A et aL, 
Anal Biochcm 2000 280:103-10), or direct sequencing) as given in Table 7, All these 
methods used cstabhshed methodologies that are provided by the equipment 
manufacturers and/or are well known to those skilled in the art. 

15 Linkage Disequilibrium 

The extent of Imkage disequilibrium (LD) between maikers was calculated using 
genotype data from 199 mirelated, unaffected ijidividuals and is expressed as 
correlation coefficients in Table 8. This analysis shows that there is extensive linkage 
20 disequilibrium between many of the Corneodesmosin polymorphisms. 

Single point association 

Single pomt associations between each SNP and psoriasis affected status were 
25 calculated using the TRANSMIT program (Clayton MRC Biostatistics Unit, 
Cambridge) - see Table 9. Highly sigiuficant associations were observed between 

SNPs 19, 21, 23, 24, 26, 28, 30, 33, 34, 37, 38 and psoriasis. The single SNP showing 
the most significant association with psoriasis tiiat has been previously repotted is 
SNP 33 (Tazi Ahnmi R et al Hum. MoL Genet. 1999: 8 ppl 135^40; Allen MH et al, 
30 Lancet 1999: 353 ppl589-90). 
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This study has identified 9 SNPs, (19, 21, 24, 26, 28, 30, 34, 37 aiid 38) which show 
global chi-squared values greater than that seen for SNP 33, and are therefore more 
powerfully predictive of affected status. 

5 

Haplot-ype analysis 

A total of 19 SNPs were used for haplotype ajialysis (SNPs at positions 29 and 32 
were excluded due to low information content). Three common haplotypcs were 

10 identified (Table 10). Of tlie three common haplotypes, haplotype B is significantly 
associated with psoriasis. The alleles are coded alphabetically (Table 10b) such that 
tlie nucleotide first in the alphabet is given coded as 1, and the other nucleotide is 
coded as 2. Thus A is always 1, T is always 2, and G or C are coded depending on the 
other nucleotide. For example, in SNP No. 1, wliich is a C to T substitution, the 

15 presence of the C allele is coded as 1 and the presence of the T allele is coded as 2 (see 
Table 10b). In Table 10a, this means that haplotypes A and B have C residues, and 
haplotype C has a T residue at this position. For an A to C substitution, the A allele 
will be coded as 1, and the C allele as 2. In a C to G substitution, the C allele will be 
1 ajid the G allele 2. 

20 

Construction of Corneodesmosm Gene Tai-getmg Vector 

As the genetic data pointed strongly to an involvement of the Corneodesmosin gene m 
the patliophysiology of psoriasis, we decided to engineer mouse strains in wliich the 
25 mouse orfhologue of the corneodesmoski gene is loiocked out by homologous 
recombination ushig a vector construct designed to remove exon 2 of the 
Corneodesmosm gene. 

Murine Corneodesmosin genonodc clones were isolated from a mouse large insert P AC 
30 Ubrary, using mouse Corneodesmosm cDNA sequence as a probe by standard 
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teclmiques. The isolated murine Corneodesniosin genomic clones were then restriction 
mapped in the region of the Coineodesmosin gene ushig small oligonucleotide prohes 
and standard techniques. The murine genomic locus was partially sequenced to enable 
tlie design of homologous arms to clone into the targeting vector„ The murine 
5 Corneodesmosin gene is a two-exon gene, A 4 kb 5' homologous aim and a 1 kb 3' 
homologous arm where amplified by PGR and the Augment cloned into the targeting 
vector. The position of these airns was chosen to functionally disrupt the 
Corneodesmosin gene by deleting the majority of the coding sequence. A targeting 
vector was prepared, where the deleted Corneodesmosin sequence was leplaced with 
10 non-homologous sequences composed of an endogenous gene expression reporter (an 
in frame fusion with lacZ) upstream of a selection cassette composed of a self 
promoted neomycin phosphoti^ansferasc (neo) gene in the same orientation as the 
Corneodesmosin gene. 

15 Transfection and Analysis of Embryonal Stem Cells 

Embryonal stem cells (Evans MJ & Kaufinan MH Natm-e 1981 292:154-6) were 
cultured on a neomycin resistant embryonal fibroblast feeder layer grown in 
Dulbecco*s Modified Eagles mediimi supplemented with 20% Fetal Calf Serum, 10% 

20 new-born calf senim, 2 mM glutamine, non-essential amino acids, lOOfiM 2- 
mercaptoethanol and 500 u/ml leukemia inliibitoiy factor. Medium was changed daily 
and ES cells were subcultured every three days. 5.times.i0..sup.6 ES cells were 
transfected- with 5 juig of linearized plasnaid by elcctroporation (25 jiF capacitance and 
400 Volts). 24 hours following elcctroporation tiie transfected cells were cultured for 9 

25 days in medium containing 200 |ig/ml neomycin. Clones were picked into 96 well 
plates, replicated, and expanded before bemg screened by PCR to identify clones in 
which homologous recombination had occuixed between the endogenous 
Corneodesmosin gene and the targeting constmct. From 96 picked clones 45 targets 
were identified. Tlrese clones where expanded to allow replicas to be frozen and 
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sufficient high quality DNA to be prepared for Southern blot coniiimation of the 
targeting event using external 5' and 3' probes, all using standard procedures (Russ et 
^/.Nature 404:95-99). 

5 Generation of Corneodesmosin Deficient Mice 

C57BL/6 female and male mice were mated and blastocysts were isolated at 3.5 days 
of gestation.. 10-12 cells from Clone 7 (described in Example 2) were injected per 
blastocyst and 7-8 blastocysts were implanted in tire uterus of a pseudopregnaiit Fl 
iO female. Five chimeric pups were bom of which one male was 100% agouti (indicating 
cells descendent from the targeted clone). This male chimera was mated with female 
and MFl and 129 mice, and germline transmission was determined by the agouti coat 
color and by PGR genotyiiiiig respectively, 

15 Corneodesmosin knock-out mouse as a model of cpmeodesmosin-mediated disease 

Mice heterozygous for Are Corneodesmosin knockout are superficially normal 
Staining for expression of the lacZ reporter gene fiiscd to the Corneodesmosin 
promoter in the knockout construct shows clear expression m desquamating skin. We 
20 tlien genotyped surviving offspring from intercrosses of heterozygous knockout mice 
on an outbred genetic background in an attempt to isolate mice homozygous for the 
knockout 

From 44 siu-viving progeny we identified: 
25 17 wild type 

27 heterozygotes 

0 homozygous mutant. 
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Statistical analysis of these data indicate that the ratio of wild typerheterozygous 
animals coBforms to a 1:2 ratio consistent with a homozygous lethal phenotype (Chi 
square = 0.557), 

5 In keeping with this analysis, two pups found dead 24-48 hours after hirth were 
homozygous mutant Together these data indicate the Comeodesmosin deficiency in 
mice is lethal with pups dying soon after birth, most likely through dehydration as a 
result of failure to establish a penneability banier in Uie skin. 

10 We conclude from this that altermg the activity of Comeodesmosin (e.g. by 
modulating expression or altering its proteolytic processing) will be useful in 
developing models of disease in which epithelial integrity is increased (e.g. psoriasis) 
or decreased (e.g. dermatitis), and for testing novel agents for the alleviation of 
Comeodesmosin mediated disease. 
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CIaijx?s 

I.. A recombinant or isolated polynucleotide comprising a nucleic acid sequence 
5 encoding the corneodesniosin gene of Figure 1, wherein said comeodcsmosin gene 
comprises a nucleotide substitution, deletion or insertion at one or more of positions 
6984, 7068, 7077, 7107, 7164, 8884, 8906, 8931, 9538, 9607, 9608, 9647, 9667, 9745, 
9761, 9926, 9952, 9968, 10082, 10161, 10162, 10363, 11567, 11641, 11649, 11808, 
11839, 11885, 11977, 12018, 12136, 12149, 12198, 12283, 12318, 12345, 12373, 
10 12901, 13001, 13020, 13108, 13117, 13178, 13224, 13316, 13365, 13562, 13605, 
13670, 13859, 138S9 and 13914 of Figure L 

2. A recombinant or isolated polynucleotide comprising a nucleic acid sequence 
encoding a fragment of the comeodesTOositi gene of Figure 1, wherein said fragment 

1 5 comprises a nucleotide substitution, deletioii or insertion according to claim 1 . 

3. A recombinant or isolated polynucleotide comprising a nucleic acid sequence 
which hybridises under stringent conditions to the corneodesmosin gene. 

20 4. A recombinant or isolated polynucleotide according to claim 3, wherein said 
nucleic acid preferentially hybridises to one allele of one or more of the 
polymorphisms of claim 1.. 

5. A recombinant or isolated polynucleotide according to claim 3 wherein Uie 
25 nucleic acid sequence hybridises under stringent conditions to a region of the 

corneodesmosin gene flanking one or niore of the polymorphisms of claim 1 , 

6. A vector comprising a polynucleotide accordiiig to any one of claims 1 to 5. 

30 7. A host cell comprising a polynucleotide or vector according to any one of 
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claims 1 to 6. 

8. A piotem comprising llie amino acid sequence of Figure 2 and having an 
amino acid substitution, deletion or insertion at one or more of positions 18, 130 or 

5 1 80 of .Figure 2, or a fi'agment thereot 

9. An antibody or antibody fragment wliich binds to a protein or protein fragment 
according to claim 8„ 

10 10, A transgenic non-lnnnan animal comprising a polynucleotide sequence 
according to any one of claims 1 to 6. 

11. Use of a transgenic non-human animal according to claim 10 in screening for 
agents for use in diagnosis or treatment of individuals having, or being susceptible to, 
1 5 corneodesmosin mediated disease. 

12-, A method of screening for an agent for use in the prognosis, diagnosis or 
treatment of individuals having or being susceptible to corneodesmosin mediated 
disease, said method comprising contacting a putative agent and with a polynucleotide 
20 or protein according to claims 1 to 5, or monitoring the reaction there between. 

13, A method of screening for an agent according to claim 13, furtlier comprising 
contacting a putative agent with a polynucleotide or protein of figures 1 or 2 
respectively; and comprising tJie reaction between 
25 (i) the agent and polynucleotide or protem of claim 1 to 5, or 8; and 
(ii) the agent and a polynucleotide or protein of Figures 1 or 2. 

A method of diagnosing for, or determining susceptibility to, corneodesmosin 
mediated disease, comprising deteimining which allele of one or more poly^norphisms 
30 of claim 1 or 7 are present in a subject. 
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15, A method according to claim 14, said method comprising detconining in a 
protein or protein &agraent which allele of one ore more polymoiphisms of claun 8 are 
present in a subject. 

5 

16. A mefliod for diagnosing and treating comeodesinosin mediated disease in a 
subject, comprising 

(i) deteiinimng which allele of one or more pol^miorphisms of the invention are 

present; aiid 

10 (ii) administering a polynucleotide sequence wliich hybridises under stringent 
conditions to the comeodesmosin gene; or a polynucleotide sequence encoding the 
reference sequence of tiie comeodesmosin gene or a Iragment thereof^ or a 
polynucleotide sequence of the first aspect, 

15 17. Use of a polynucleotide sequence according to claim 16 in the manufacture of 
a medicament for use in the diagnosis and treatment of comeodesmosin mediated 
disease. 

18. A kit for use in diagnosis of an individual having, or being susceptible to, 
20 comeodesmosin mediated disease, said Idt comprising an agent for detection of a 
polynucleotide, protein or protein fragment according to any one of claims 1 to 5 or 7 

or 8, together with a key correlating the alleles of one or more polymorphisms with 
presence of, or susceptibility to^ comeodesmosin mediated disease.. 

25 19. A kit according to claim 18 wherein the agent comprises a polynucleotide 
according to claim 3, an antibody according to claim 9 or restriction en2ymes for 
digestion of a polynucleotide according to claim 1 or 2. 
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FIG. 1 



Genomic Sequence of Come odesmo sin Gene 

6701 TGGAGGGCAG ZVTGGAGAGAC AGGCCAAGCC ACGGTAGGCA GGAGAGTTAA 
6751 GGAGCCAGGC AGGTGGGTCC CGTGGCAAGA GTGGCCGCCC CAGAGTGGGT 
6801 GGCCGTGGGG CAGAGCGCCT GGTTCCGGGT TAGGCAATGA GGAGCCGGGG 
6851 CCAGGCCTGT CAGGTGGCAG GATCGTTAGA GCCCCGTGGC CATGGGTACC 

6901 CCACACTGCA GCCACTGCTG CTGCTGAGTA GGCAGATGCA CCGGGCTGAT 



7051 ^^^^^^^^^^^^l§^^^^^^^^^^^^^^ 

7101 GTGCAGCCCG AGATGGGCTC GTCTCGGGCA CCCTGGATGG GGCGTGTGG G 

7151 TGGGCACGGG ATGATGGCAC TGCTGCT6GC TGGTCTCCTC CTGCCAGG TA 

720 a GGAGGCTGGG GGCCCTGGGA ACAGGAGGGA GGCGGGAGGG AGACTCCGGG 

7251 AGAGGACCCA GCGAAGGGGA CGGGCAGGGG CTCrGGAATC TGCCTTTTGA 

7301 GTCTGGGGGT TGCTCCTGAC TGTATGGTCG CCTCAGGT^A GTTTCTTAAA 

7351 CTTCCTGAGC CCCAGTTTCT GAAATTCTGA AGTGGGGTTA ATGACACCTA 

7401 CCTCTAGTCT GTGTGTCTCA AATTAAATAA TGTATGTGAT 7VTGTACTTTG 

7451 GAAATTCTS.G AGGTTTATAT AAATGGTGGT GGTGATTTTT ATTATGGGAG 

7501 CACTACAAGA TAATGATTGG ACATTTAATA GTAATAATAT CATTTTTAGA 

7551 GCCTTTTTAT ATGCTAGACT CTGTTTTAAG CACATTTGGA TTATATATTA 

7601 GAACTTTTAT TTTTATTTTT TTTGTGAGAT GGAGTCCCAC TCTGTCTCCA 

7651 AGGCTGGAGT GCAGTGGCGT AATCTCGGCT CACTGCAACT TCCACCTCTC 

7701 AGGTTCAAGC GACTCTCATG CCTCAGCCTC TAGAGTAGCT GGGACAACAG 

7751 GTGCCCATCA CCACACCTGG CTAATTTTCT TTTTTTXGTA TTTTTAGTAG 

7801 AAACAGGGTT TTACCATTTT GGTCAAGCTG GTCTTGAACT CCTGACTCAA 

7851 GTGATCCGCT CGCCTCGGCC TCCCAAGGTG CTGGGATTAC AGGCATGAGC 

7901 CACCACACCC GGCCTATATT AGCACTTTTG ATCATTACAA GAACGGTATG 
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7951 

8001 

8051 

8101 

8151 

8201 

8251 

8301 

8351 

8401 

8451 

8501 

8551 

8601 

8651 

8701 

8751 

8801 

8851 

8901 

8951 

9001 

9051 

9101 

9151 

9201 



AAAAGKGATT TGCmTTTCC ACTCTACAGA TGAGGACACT GAGGCTCGGA 
GAGGTTAGGA AACTAGCTCA 7\AATCATGCA TTAGAAGGCA GCAAAGCCAA 
GATTTCAACC CCAGGCCAGG CAACCCCTGG ACCTGTGTTG TTGAGCACCG 
GGTACTTATA GCCCTTGAGG AATTTCTGCG ACCTTCCCAT GGTCTAGTGG 
GTGGTTGGTG TCTGAGGGAA TAGCGAAAGA GAGAGGCAAT GCATGGTGGA 
TTCGTGCAGA GGACTGAAGG GAATTGGCAC AGCTGGGGTT CGGCGTGGAG 
GTGCATGCAG AGAATTTCTT TCTGAGGAGA GAACAGGGAC ATCACAGAGG 
ATGGCAGTCT GGTTGTTGGT GGAGGGATCA GGATGAGTGG CAGTAATAAT 
TCATAATATA TAATGCTTTA CACTTTCTAA AACATCTGGC CGCACATGAT 
AGCTTGTGCC TGTAArCCCA ACACTTCAGG AGGCCAAGGC AGGTGAATCG 
CCTGAGGTCA GGAGf TCAAG ACCAGCCTGG CCA?\GATGGT GAAACCCCCT 
CTCTACTAAA AATACAAAAA ATTAGCTGGG TGTGGTGGCG GGCACCTGTG 
GTCCCAGCTA CTTGGGAGGC TGAGGCAGGA GAATCGCTTG CACCAAGGAG 
GCAGAGGTTA CAGTGAGCTG AGACCGTGTT ATTGCBCTTT AGCCTGGGCA 
ACAAGAAACT CCATCTCACA AAZVAAA7VAAA AAAAAAAAAA AAAGAAGAAA 
7VMCTTCCAG GTGGATGATC TCATTTAGTT TTCTTCATAG TAATGCTGTG 
GGAAGGCAGG GAAAATTTGG CCCCTCTGAA TGTATAAACT AAAGCTCAGA 
GAGGTTCAGT AACTTGCTAG TATGTGGCTC TGTTTGTAAC ACGTGGGACC 
TGGAGGGGCT AGGGAAGGCA GAAGGAACGC AGGTGAAAGA GTCATGGAGG 
AACCATGGGG TAAGTTGGGC CTGGGGTTTT GAGCAAAGGA AAGGAAAGAT 
AAGGAAAGAT GTGGCTCCAC ATCCCTGAGG GAAGTCAAGG CAGCAGAAGT 
CAGATGAGGG GCTGGACAGA GGCAGGTGTG CTCAGAGAGG GAAGCTGATT 
GTGGCCAGGA GCCTCGGAGG TTCGTGGGGT TTCGTCCTGG TTCCCTGGGC 
TGGGCCAGCG AGAGCAGGGC TGGCTCAGGG TGCGGTGTCC TGACACACTG 
GTACCAGCAG GTTCTGA?^.GC AACAGGTAGT GACCCCACAT CCTGGCCCCC 
ACCCAGCTTT ACTGGCATGG CCAGTGCTGA GATAGGAAAT AGGGTTTCCA 
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9251 


TTCCTGACCC 


CAGCCTGGGC 


TCTCACGAAG 


AAGCTGGTGA 


CCAAaTCTTA 


9301 


GTCCTCGAGT 


GCCCTTTCCT 


TTATTTCAGC 


CCCTCTGCCC 


CCAGCTTl'GT 


9351 


CTTTTTCCAG 


TGTCTCCTTC 


TATATGTGTC 


rn. m rn /~i m 

rCCACTTCTC 


AGCCCTCCAT 


9401 


TGTTTTGCCT 


TTTGTCTTCT 


TCCCTCTGGT 


CCGACTGTCT 


GGCCCAGGAT 


9451 


TTTTCCCCTA AGAATTTACG 


CCTGGACTCC 


TCAGAGCCTC 


AGTTTCCCCA 


9501 


ATTCTCTGTC 


TCTTCAGGGT 


CCTTTCTTTT 


AGACCTATTT 


GTTCCTGCCC 


9551 


CTTCTCCATT 


CCCTCTTCTT 


fn m m "h ^ t\ *a 'a 

TTTAAAAAAA 


ATTTTAATTA 


Tl "r\ Tl "7^ "Tl TL 'A 

AAAAACA?\AA 


9601 


TACAGATGGG 


GTCTATGTTG 


CCCAGGCTGG 


TCTTGT^CTC 


TGGGGCGCAi 


9651 


GCAATCCTCC 


CACCTCAGCG 


TCCCAAAGTG 


CTGGGATTAC 


CGGCGTGAGC 


9701 


CACTGTGCCC 


AGCCCCCTCT 


TArAi TCAAT 


GlATiCCjTTT 


GAGGTCAGIC 


9751 


ACTTTGGCAC GTAATTTTCT 


ATTTTTCTGG 


TTGGTGTTTG 


CCCACCCTTC 


9801 


CCAAACAAAG AAATGCCTTT 


ATTCGGCCAC 


CTCMTATCC 


TTTAGAGACA 


9851 


ATAGCCAGTT 


CrTCCTCCTl' 


TCTCCATCCC 


taa:actctcc 


CTGCGCTCTG 


9901 


CTTGGGAGAA 


ACCCGAGAGG 


CCGATTACTG 


AGATAAGGCA 


GAAAGGTGAG 


9951 


GGAGGAAGCC 


AAGCCTCCTT 


GGCCCTTACT 


AACCACTGCT 


TTCCTCCACA 


lOOOX 


GGGACCTTGG 


CTAAGAGCAT 


TGGCACCTTC 


TCAGACCCCT 


GTAAGGACCC 


10051 


CACGCGTATC 


ACCTCCCCTA 


ACGACCCCTG 


CCTCACTGGG 


A7\GGGTGACT 


10101 


CCAGCGGCTT 


CAGTAGCTAC 


AGTGGCTCCA 


GCA6TTCTGG 


CAGCTCCATT 


10151 


TCCAGTGCCA GA?\GCTCTGG 


TGGTGGCTCC 


AGTGGTAGCT 


CCAGCGGATC 


10201 


CAGCATTGCC 


CAGGGTGGTT 


CTGCAGGATC 


TTTT7VAGCCA 


GGAACGGGGT 


1025,1 


ATTCCCAGGT 


CAGCTACTCC 


TCCGGATCTG 


GCTCTAGTCT 


ACAAGGTGGA 


10301 


TCCGGTTCCT 


CCCAGCTGGG 


GAGCAGCAGC 


TCTCACTCGG 


GAAGCAGCGG 


1 A "5 C T 

lOJbi 


CTCTCACTCG 


GGAAGCAGCA 






J lOjs-'^'OVj i J, J. ^ 


10401 


AGTTCAGCAG 


CAGCAGCTTC 


CAAGTAGGGA 


ATGGCTCTGC 


TCTGCCAACC 


10451 


AATGACAACT 


CTTACCGCGG 


AATACTAMC 


CCTTCCCAGC 


CTGGACAAAG 


10501 


CTCTTCCTCT 


TCCCAAACCT 


CTGGGGTATC 


CAGCAGTGGC 


CAAAGCGTCA 






FIG. 


1 CONT'D 










1 ir-^r^-rf-ri t-rt^ 
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10551 GCTCCAACCA GCGTCCCTGT AGTTCGGACA TCCCCGACTC TCCCTGCAGT 



10601 


GGAGGGCCCA TCGTCTCGCA CTCTGGCCCC 


TACATCCCCA 


GCTCCCACTC 


10651 


TGTGTCAGGG 


GGTCAGAGGC 


CTGTGGTGGT 


GGTGGTGGAC 


CAGCACGGTT 


10701 


CTGGTGCCCC 


TGGAGTGGTT 


CAAGGTCCCC 


CCTGTAGCAA 


jrGGTGGCCTT 


10751 


CCAGGCAAGC 


CCTGTCCCCC 


AATCACCTCT 


GTAGACAAi^T 


CCTATGGTGG 


10801 


CTACGAGGfG 


GTGGGTGGCT 


CCTCTGACAG 


TTATCTGGTT 


CCAGGCATGA 


10851 


CCTACAGTAA 


GGGTAAAArc 


TAa?CCTGrGG 


GCTACTTCAC 


CAAAGAGA?iC 


10901 


CCTGTGAAAG 


GCTCTCCAGG 


GGTCCCTTCC 


TTrGCAGCTG 


GGCCCCCCAT 


10951 


CTCTGAGGGC AAATACTTCT 


CCAGCAACCC 


CATCATCCCC 


AGCCAGTCGG 


11001 


CAGCTTCCTC 


GGCCATTGCG 


TTCCAGCCAG 


TGGGGACTGG 


TGGGGTCCAG 


11051 


CTCTGTGGAG 


GCGGCTCCAC 


GGGCTCCAAG 


GGACCCTGCT 


CTCCCTCCAG 


11101 


TTCTCGAGTC 


CCCAGCAGTT 


CTAGCATTTC 


CAGCAGCtrCC 


GGTTCACCCT 


11151 


ACCATCCCTG 


CGGCAGTGCT 


TCCCAGAGCC 


CCTGCTCCCC ACCAGGCACC 


11201 


GGCTCCTTCA 


GCAGCAGCTC 


CAGTTCCCAA TCGAGTGGCA AAATCATCCT 


11251 


TCAGCCTTGT 


GGCAGCAAGT 


CCAGCTCTTC 


TGGTCACCCT 


TGCATGTCTG 


11301 


TCTCCTCCTT 


GACACTGACT 


GGGGGCCCCG 


ATGGCTCTCC 


CCATCCTGAT 


11351 


CCCTCCGCTG 


GTGCCAAGCC 


CTGTGGCTCC 


AGCAGTGCTG 


GAAAGATCCC 


11401 


CTGCCGCTCC 


ATCCGGGATA 


TCCTAGCCCA 


AGTGMGCCT 


CTGGGGCCCC 


11451 


AGCTAGCTGA 


CCCTGAAGTT 


TTCCTACCCC 


AAGGAGAGTT 


ACTCGACAGT 



11501 CCOTA^iiB§§^\^ 

11601 fScAGi^pce^^ 

11651 liiSMe^Siifiii^ 

117 51 IJ^^g^lg^^gag^ 

11801 ^^^^M^^^^^^^^^^^i^^: ^^^S^M^^^M^Wm^ 
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^^^01 ilSiiiiilM^PililiPil^^ 

12001 PSiiiMis^^i^i^siisi?^^ 

12051 

; 12101 W^MM^^^W^^l^^ 

12151 iiii^siili^liiSl^ 

12251 iiiSilill^^il^ii^^ 

12301 ligsiiiiMiiiiiMM^ 

FIG. 1 CONT'D 
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Ij^.B'^'S^^gLii^ji^^ Sxon Secjuence SNP 
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FIG. 2 

CORNEODESMOSIKT ( af 0 3 0 13 0 , em_„huiiil) 



T 

ATGGGCTCGTCTCGGGCACCCTGGATGGGGCGTGTGGGTGGGCACGGGATGATGGCACTG 

15 + -h 4 + + 74 

TACCCGAGCAGAGCCCGTGGGACCTACCCCGCACACCCACCCGTGCCCTACTACCGTGAC 

HGSSRAPWMGRVGGHGMHAL 

L 

CTGCTGGCTGGTCTCCTCCTGCCAGGGACCTTGGCTAAGAGCATTGGCACCTTCTCAGAC 

75 + ^ + 4 h 4 134 

GACGACCGACCAGAGGAGGACGGTCCCTGGAACCGATTCTCGTAACCGTGGAAGAGTCTG 

LLAGIiLIiPGTIiAKSIGTFSD 

T T 
CCCTGTAAGGACCCCACGCGTATCACCTCCCCTAACGACCCCTGCCTCACTGGGAAGGGT 

135 + + -^ -f + 4- 194 

GGGACATTCCTGGGGTGCGCATAGTGGAGGGGATTGCTGGGGACGGAGTGACCCTTCCCA 

PCKDPTRITSPNDPCLTGKG 

S 

T 

GACTCCAGCGGCTTCAGTAGCTACAGTGGCTCCAGCAGTTCTGGCAGCTCCATTTCCAGT 

195 f ^ + + + i 254 

CTGAGGTCGCCGAAGTCATCGATGTCACCGAGGTCGTCAAGACCGTCGAGGTAAAGGTCA 

DSSGFSSYSGSSSSGSS13S 



GCCAGAAGCTCTGGTGGTGGCTCCAGTGGTAGCTCCAGCGGATCCAGCATTGCCCAGGGT 

255 h-" + ^ + ^ ^ 314 

CGGTCTTCGAGACCACCACCGAGGTCACCATCGAGGTCGCCTAGGTCGTAACGGGTCCCA 

ARSSGGGSSGSSSGSSIAQG 



GGTTCTGCAGGATCTTTTAAGCCAGGAACGGGGTATTCCCAGGTCAGCTACTCCTCCGGA 

315 + -i^ 4 + + + 374 

CCAAGACGTCCTAGAAAATTCGGTCCTTGCCCCATAAGGGTCCAGTCGATGAGGAGGCCT 

GSAGSPKPGTGYSQVSYSSG 



TOTGGCTCTAGTCTACAAGGTGCATCCGGTTCCTCCCAGCTGGGGAGCAGGAGCTCTCAC 

375 + + + 4-^^ V 4^-^-^ 434 

AGACCGAGATCAGATGTTCCACGTAGGCCAAGGAGGGTCGACCCCTCGTCGTCGAGAGTG 

SGSSLQGASGSSQLGSSSSH 
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A del 

TCGGGAAGCAGCGGCTCTCACTCGGGAAGCAGCAGCTCTCATTCGAGCAGCAGCAGCAGC 
435 1 + ^ + , ^ 

AGCCCTTCGTCGCCGAGAGTGAGCCCTTCGTCGTCGAGAGTAAGCTCGTCGTCGTCGTCG 

SGSSGSRSGSSSSESSSSSS 
U del 

TTTCAGTTCAGCAGCAGCAGCTTCCAAGTAGGGAATGGCTCTGCTCTGCCAACCAATGAC 

495 + + + + + 554 

AAAGTCAAGTCGTCGTCGTCGAAGGTTCATCCCTTACCGAGACGAGACGGTTGGTTACTG 

FQ FSSfSSFQVGWGSALPTND 

G 

AACTCTTACCGCGGAATACTAAACCCTTCCCAGCCTGGACAAAGCTCTTCCTC'PrCCCAA 

555 -V + \ ■ + 614 

TTGAGAATGGCGCCTTATGATTTGGGAAGGGTCGGACCTGTTTCGAGAAGGAGAAGGGTT 

NSYRGILNPSQPGQSSSSSQ 

T 

ACCTCTGGGGTATCCAGCAGTGGCCAAAGCGTCAGCTCCAACCAGCGTCCCTGTAGTTCG 
615 4 + + 4. + + 

TGGAGACCCCATAGGTCGTCACCGGTTTCGCAGTCGAGGTTGGTCGCAGGGACATCAAGC 

TSGVSSSGQSVSSWQRPCSS 
F 

C 

GACATCCCCGACTCTCCCTGCAGTGGAGGGCCCATCGTCTCGCACTCTGGCCCCTACATC 

675 4 h ^ + + 734 

CTGTAGGGGCTGAGAGGGACGTCACCTCCCGGGTAGCAGAGCGTGAGACCGGGGATGTAG 

DIPDSPCSGGPI VSHSGPYI - . 

S 

A 

CCCAGCTCCCACTCTGTGTCAGGGGGTCAGAGGCCTGTGGTGGTGGTGGTGGACCAGCAC 

735 + + )- + -f + 794 

GGGTCGAGGGTGAGACACAGTCCCCCAGTCTCCGGACACCACCACCACCACCTGGTCG'TG 

PSSHSVSGGQRPVVVVVDQ H 



GGTTCTGGTGCCCCTGGAGTGGTTCAAGGTCCCCCCTGTAGCAATGGTGGCCTTCCAGGC 
CCAAGACCACGGGGACCTCACCAAGTTCCAGGGGGGACATCGTTACCACCGGAAGGTCCG 
GSGAPGVVQGPPCSNGGIjPG 



FIG. zlCONTD 
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AAGCCCTGTCCCCCAATCACCTCTGTAGACAAATCCTATGGTGGCTACGAGGTGGTGGGT 



855 " 




914 



KPCPPI T SVDKSYGGYEVVG 



C 



GGCTCCTCTGACAGTTATCTGGTTCCAGGCATGACCTACAGTAAGGGTAAAATCTATCCT 

9X5 + + H + + ~h 974 

CCGAGGAGACTGTCZiATAGACCAAGGTCCGTACTGGATGTCATTCCCATTTTAGATAGGA 

GSSDSYLVPGMTYSKGKIYP' 



GTGGGCTACTTCACCAAAGAGAACCCTGTGAAAGGCTCTCCAGGGGTCCCTTCCTl'TGCA 

975 + H + + + i- 1034 

CACCCGATGAAGTGGTTTCTCTTGGGACACTTTCCGAGAGGTCCCCAGGGAAGGAAACGT 

VGYFTKEWPVKGSPGVPSFA 



GCTGGGCCCCCCATCTCTGAGGGCAAATACTTCTCCAGCAACCCCATCATCCCCAGCCAG 

103 5 ■^ + + 1 + + 1094 

CGACCCGGGGGGTAGAGACTCCCGTTTATGAAGAGGTCGTTGGGGTAGTAGGGGTCGGTC 

AGPPISEGKYFSSNPIIPSQ 

A 

TCGGCAGCTTCCTCGGCCATTGCGTTCCAGCCAGTGGGGACTGGTGGGGTCCAGCTCTGT 

1095 + 4- + 4' + 1154 

AGCCGTCGAAGGAGCCGGTAACGCAAGGTCGGTCACCCCTGACCACCCCAGGTCGAGACA 

SAASSAIAFQPVGTGGVQLC - 



GGAGGCGGCTCCACGGGCTCCAAGGGACCCTGCTCTCCCTCCAGTTCTCGAGTCCCCAGC 
1155 H + f + 1214 

CCTCCGCCGAGGTGCCCGAGGTTCCCTGGGACGAGAGGGAGGTCAAGAGCTCAGGGGTCG 

GGGSTGSKGPCSPSSSRVPS 
G T T T 

AGTTCTAGGATTTCCAGCAGCGCCGGTTCACCCTACCATCCCTGCGGCAGTGCTTCCCAG 

1215 + + ^ + + ^- 1274 

TCAAGATCGTAAAGGTCGTCGCGGCCAAGTGGGATGGTAGGGACGCCGTCACGAAGGGTC 

SSSISSSA6SPYHPCGSASQ 



S V L 
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C 

AGCCCCTGCTCCCCACCAGGCACCGGCTCCTTCAGCAGCAGCTCCAGTTCCCAATCGAGT 

1275 H -I + + 1 1334 

TCGGGGACGAGGGGTGGTCCGTGGCCGAGGAAGTCGTCGTCGAGGTCAAGGGTTAGCTCA 

SPCSPPGTGSFSSSSSSQSS 

c 

GGCAAAATCATCCTTCAGCCTTGTGGCAGCAAGTCCAGCTCTTCTGGTCACCCTTGCATG 

1335 -r h + + -i + 1394 

CCGTTTTAGTAGGAAGTCGGAACACCGTCGTTCAGGTCGAGAAGACCAGTGGGAACGTAC 

GKIILQPCGSKSSSSGHPCM 



TCTGTCTCCTCCTTG/^CACTGACTGGGGGCCCCGATGGCTCTCCCCATCCTGATCCCTCC 

1395 H + + 1 ^ 1454 

AGACAGAGGAGGAACTGTGACTGACCCCCGGGGCTACCGAGAGGGGTAGGACTAGGGAGG 

SVSSLTIiTGGPDGSPHPDPS 



GCTGGTGCGAAGCCCTGTGGCTCCAGCAGTGCTGGA?IAGATCCCCTGCCGGTCCATCCGG 

1455 H ^ + + + 1514 

CGACCACGGTTCGGGACACCGAGGTCGTCACGACCTTTCTAGGGGACGGCGAGGTAGGCC 

AGAKPCGSSSAGKIPCRSIR 



GATATCCTAGCCCAAGTGAAGCCTCTGGGGCCCCAGCTAGCTGACCCTGAAGTTTTCCTA 

1515 + + + + + + 1574 

CTATAGGATCGGGTTCACTTCGGAGACCCCGGGGTCGATCGACTGGGACTTCAAAAGGAT 

DILAQVKPLGPQLADPEVFL 



A 

CCCCAAGGAGAGTTACTCGACAGTCCATAA 

1575 + ^ + 1604 

GGGGTTCCTCTCAATGAGCTGTCAGGTATT 

P Q G E L L D S P 



FIG. ^ICONTD 
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